Scaling up the Accuracy of K -nearest-neighbour Classifiers: a Naive-bayes Hybrid

نویسندگان

  • L. Jiang
  • D. Wang
  • Z. Cai
  • S. Jiang
  • X. Yan
  • Weimin Zheng
چکیده

k-nearest-neighbour (KNN) has been widely used as an effective classification model. In this paper, we summarize three main shortcomings confronting KNN and then single out three categories of approaches for overcoming its three main shortcomings. After reviewing some algorithms in each category, we presented a hybrid algorithm called dynamic k-nearest-neighbour naive Bayes with attribute weighting (simply DKNAW) by combining three improved approaches. We conduct extensive empirical comparison for the related algorithms in four groups, using the whole 36 UCI data sets selected by Weka. In the first three groups, we compare some algorithms in each category accordingly. In the forth group, we compare our hybrid approach to each single approach. At last, we discuss some directions for our future work on KNN

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance Evaluation of Multistage Classifier

Ensemble of classifiers is one of the most researched methods in pattern classification in recency. It’s a well-known fact that multiple phases for evaluation provides more accuracy. In this paper we proposed a multistage classifier approach where we are applying three supervised classifiers for the classification in pattern recognition. Three Classifiers are Multilayer Perceptron (MLP), K-Near...

متن کامل

Scaling Up the Accuracy of Naive-Bayes Classifiers: A Decision-Tree Hybrid

Naive-Bayes induction algorithms were previously shown to be surprisingly accurate on many classii-cation tasks even when the conditional independence assumption on which they are based is violated. However , most studies were done on small databases. We show that in some larger databases, the accuracy of Naive-Bayes does not scale up as well as decision trees. We then propose a new algorithm, ...

متن کامل

Generating Estimates of Classification Confidence for a Case-Based Spam Filter

Producing estimates of classification confidence is surprisingly difficult. One might expect that classifiers that can produce numeric classification scores (e.g. k-Nearest Neighbour or Naive Bayes) could readily produce confidence estimates based on thresholds. In fact, this proves not to be the case, probably because these are not probabilistic classifiers in the strict sense. The numeric sco...

متن کامل

Attack Type Prediction Using Hybrid Classifier

Due to the rapid increase in terrorist activities throughout the world, there is serious intention required to deal with such activities. There must be a mechanism that can predict what kind of “attack types” can happen in future and important measures can be taken out accordingly. In this paper, a hybrid classifier is proposed which consists of some existing classifiers including K Nearest Nei...

متن کامل

Comparison of Classification Methods: Peril to Avoid for Binary and Multi Propose Combination Approach

ABSTRACT: Classification plays an important role in various fields like Object recognition, text categorization etc. Studying classifiers for purpose of estimating probability for a ce is crucial for classification .In this paper, we present a survey of four k Nearest Neighbour, Naive Bayes and Neural Network focusing on their merits and demerits.We will also shed light on combination of the ab...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009